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Abstract — We propose a new write channel model for bit- 
patterned media recording that reflects the data dependence of 
write synchronization errors. It is shown that this model ac- 
commodates both substitution-like errors and insertion-deletion 
errors whose statistics are determined by an underlying channel 
state process. We study information theoretic properties of the 
write channel model, including the capacity, symmetric informa- 
tion rate, Markov- 1 rate and the zero-error capacity. 

Index Terms — Bit-patterned media, High-density magnetic 
recording, Channel capacity, Symmetric information rate, 
Markov- 1 rate, Zero-error capacity. 



I. Introduction 

MAGNETIC recording channels are typically modeled 
as binary-input intersymbol interference (IS I) channels, 
also called partial response channels (2j, (3). An implicit 
assumption made in these channel models is that the data are 
correctly written on the disk and that errors occur only during 
the readback process. While this is a realistic assumption in 
conventional recording on continuous media, it is questionable 
in the context of certain advanced recording technologies, 
such as bit-patterned media (BPM) recording, that may be 
required to achieve higher storage densities. In this paper, we 
will examine some of the underlying causes of errors in the 
recording process, particularly in BPM recording, and then 
propose a new probabilistic write channel model that captures 
some of the data dependence of these write errors. Thus, the 
input to this write channel model is the data sequence to be 
recorded and the channel output is the "noisy" sequence that 
actually gets stored on the medium. This leads to a description 
of the full data recording and readback process as a cascade 
of an imperfect write channel and a noisy (partial response) 
readback channel. 

Hu et al. [4] proposed a model for the BPM recording chan- 
nel in which the write channel was a binary symmetric channel 
(BSC), and the readback channel was a linear, intersymbol- 
interference (ISI) channel with additive noise. They proposed 
and evaluated detection methods for this channel, and they 
investigate achievable information rates for such a system. In 
|(5), we considered an idealized cascaded channel model in 
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which the write channel was again a BSC and the readback 
channel was a memoryless, binary-input, additive white Gaus- 
sian noise (AWGN) channel. We studied theoretical properties 
of this channel, as well as the decodable regions of LDPC 
codes under a number of decoding algorthms. In this paper, 
we focus on a new write channel model, and we determine 
and compare bounds on several relevant information-theoretic 
limits: capacity, symmetric information rate (SIR), information 
rate with first-order Markov inputs (Markov- 1 rate), and zero- 
error capacity. 

The remainder of the paper is organized as follows. In 
Section [TTJ we start with a brief description of the write 
process in bit-patterned media recording, highlighting the main 
factors leading to write errors. We review some useful math- 



ematical notation in Section III-A and then present the new 
probabilistic, data-dependent write channel model in Section 



III-B We discuss the types of write errors that can occur - 
including substitution-like errors and insertion-deletion errors 
- and explain their connection to the channel state process that 
underlies the model. Two classes of channel state processes are 
introduced: a Bernoulli state process (Section III-B li and a 



binary Markov state process (Section III-B2 1. In Section III-C 
we make the observation that, although proposed as a model 
for BPM recording, the Markov state channel model is also 
relevant to high-density magnetic recording using conventional 



granular media. In Section IV we give bounds on the capacity, 
the SIR, and the Markov- 1 rate of the Bernoulli state channel. 
Section [V] gives similar bounds on the capacity for the binary 
Markov state channel. The SIR is numerically computed for 
both of the channels considered in Section Hill In Section 



VI we introduce a generalization of binary Markov state 
channels, namely the A"-ary Markov state channels. For one 
such generalized channel, we numerically estimate the SIR 
and derive bounds on the channel capacity. Finally, in Section 



VII we explore the zero-error capacity of the proposed class 



of write channel models. Section VIII provides concluding 
remarks. 



II. Bit-Patterned Media Recording 

A conventional magnetic recording medium is a continuous 
film of magnetic grains that coats the surface of the disk 
substrate. Each grain is an atomic magnetic unit that assumes 
one of two possible magnetic states. A group of grains together 
form a bit-cell, an entity that stores one bit of information. 
Therefore, as the areal information density is increased, the 
number of grains forming a bit-cell reduces. One of the prob- 
lems with high-density magnetic recording in conventional 
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media is the super-paramagnetic effect, wherein the magnetic 
states of individual grains change due to the influence of 
neighbouring grains or due to changes in temperature. When 
the areal information density is increased to point where there 
are only a few grains per bit, such uncontrolled changes 
in the magnetic states of grains are detrimental to reliable 
information storage. 

Bit-patterned media recording (BPMR) proposes to get 
around this problem by making use of patterned magnetic 
islands separated by non-magnetic material [6|. However, this 
new structure of the magnetic medium introduces technical 
challenges not seen in recording on conventional media. An 
immediate requirement of this media structure is near-perfect 
synchronization of the write process to ensure that the write 
head is positioned correctly over the magnetic islands on this 
disk, i.e., to ensure that the head is positioned within the 
so-called writing window zone of the islands [7|. Assuming 
that this write synchronization is achieved through the use 
of timing synchronization patterns, there is a possiblity of 
frequency and/or phase mismatch leading to incorrect writing. 
Furthermore, even without a timing mismatch, imperfections 
in the configuration of the patterned magnetic islands may 
cause writing errors. Finally, as in conventional magnetic 
recording, the switching field distribution of magnetic grains 
may contribute to errors in the write process. We will refer to 
write errors induced by any of these mechanisms as written-in 
errors. 

Another important feature of BPMR is the geometry of the 
writing process. Along the down-track direction, the span of 
influence of the magnetic write field is typically larger than 
the spacing between the islands. Therefore, at any given time, 
the write head influences multiple adjacent islands. So, in the 
process of recording a bit on a specified island, the bit value 
is also recorded on a certain number of subsequent islands, 
with these islands themselves being overwritten in the future 
by subsequent bits. Fig.[T]gives an illustration of this idealized 
write process. We refer to the number of subsequent islands 
influenced by the write head as the writing span, D. We will 
assume throughout this paper that D > 1, which implies that 
write process has memory and, as a consequence, the write 
channel is data-dependent. 



III. Write Channel Model 



A. Notation 



We will denote random variables using capital letters X, 
Y, Z; random vectors as X\ = (JQ, JQ+i, • • • ,Xj),i, j G N 
when i < j, a null variable when i > j or when j is non- 
positive. Infinite-dimensional random vectors are considered 
to be discrete-time random processes and are denoted using 
calligraphic letters X, y, Z. The alphabets over which ran- 
dom processes are defined are denoted by ^, Y, Z. Unless 
otherwise stated, we will denote the channel input variable, 
process and alphabet using the letters X, X and X respectively. 
Similarly, Y, y, Y will correspond to the channel output, and 
Z, Z, Z to the channel state (Z is not to be confused with 
the set of integers Z). For a real number a g [0, 1], we define 
a = 1 — a. 
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Fig. 1. Example illustrating writing one bit on each island in BPMR with 
a write head of writing span D = 3. Depicted here are snapshots of the top 
view of the disk and write mechanism. Initially, all islands, represented here 
using gray squares, have unknown magnetic states represented by X, which 
could be or 1. At the first time instant, the write head, shown here using a 
dashed rectangle, is positioned over the first four islands. In this position, the 
first island is written with the data bit 0. The three subsequent islands are also 
written with the first data bit 0, before being overwritten with their own data. 
After each bit is written, the head is moved over to the next island along the 
down-track direction. At each position of the write head, the corresponding 
"current" island is highlighted with a black square boundary. 



B. Model 

We assume that the input, output and the channel state 
alphabet is the binary set ^ = Y = Z = {0,1}. The write 
channel model can be written as 



Yi = X i ®{X i ®X i _ 1 )®Z i 



(1) 



where the state process Z is independent of the inputs and the 
outputs, and and <g) represent addition and multiplication in 
GF(2), respectively. The channel state Zi is not to be confused 
with the magnetic state of the island. 

It is natural to assume that the channel input and output 
alphabets are binary since the (intended and actual) magnetic 
states of the islands can be one of two possible states. The 
channel state Zi is a random variable that represents a failure 
in writing due to one of the conditions mentioned in the 
previous section, i.e., when Zi = 1 the write head can be 
assumed to have failed in writing the intended bit. However, 
this does not necessarily imply a written-in error because there 
would be no error if the bit to be written is the same as 
the existing magnetic state of the island. This is captured by 
the term (Xi ® that is multiplied with Zi in ([TJ. This 

"noise" term {Xi®Xi_\)(g)Zi is justified because we assume 
that the timing mismatch or the irregularity of island patterns 
can cause the write head to be positioned, in the worst case, 
on the island immediately following the correct island. We 



will continue with this assumption until Section VI where we 



construct a more generial write channel model. Also note that 
when the first bit is written late, i.e. when Z\ = 1, we have 
Y\ = Xq which represents the pre-existing magnetic state of 
the first island. We will assume that X is equally likely to be 
a or a 1. Similarly, when the last bit is written late, the last 
island has a bit in error if the last and the penultimate bits are 
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different. In either case, n magnetic islands are read and their 
contents are interpreted as the n data bits so that there is no 
blocklength mismatch. 

Based on the Zi sequence, the channel in ([JJ) appears 
to produce different types of errors. When the Zi sequence 
consists of isolated ones, the channel appears to produce 
substitution errors. This is illustrated in the Fig. [2] Such 

Zi 0010010000 
X t 1 1 1 1 



Yi 0001001100 



Fig. 2. An example of substitution-like errors produced by the write channel. 
The sequence Zi gives the channel state for each magnetic island. In this 
example, Zi = 1 for the third and the sixth islands. The sequence X, gives 
the intended magnetization of the islands, i.e., the data to be written. Taking 
into account the channel states Zi , the sequence Yi shows the resulting island 
magnetizations. Note that substitution-like errors occur only when Zi = 1, 
as was the case with the sixth island here, highlighted with a box around the 
island in the Yi sequence. However, not every Zi = 1 results in an error, as 
illustrated by the third island. 

substitution-like errors can occur when the write head misses 
islands at random and independently of its success in writing 
on previous islands. Noting that when Zi = 0, Yj = so 
that the output reproduces the input exactly, and when Zi = 1, 
Yi = Xi-i so that the output reproduces the input with a delay 
of one time instant, we can see that another way to write the 
relation in ([TJ is 

Yi=Xi_ Zi . (2) 

Now, when the Z t sequence consists of long runs of ones, the 
channel appears to produce paired insertion-deletion errors, 
with insertions accompanying — >• 1 channel state transitions 
and deletions accompanying 1 — >• channel state transitions, 
as shown in Fig. [3] These insertion-deletion errors can happen 

^ 0001111000 
X t 1 1:1:1 




Yi 0000101100 



Fig. 3. An example of an insertion-deletion error. Due to the channel state 
sequence Zi, the channel inputs are transformed as shown by the arrows 
between the Xi and Yi sequences with the resulting magnetic states of 
the islands shown in the sequence Yi. The inserted bit in the Yi sequence 
accompanying the channel state transition — > 1 is shown with a box around 
the corresponding island. The deleted bit from the Xi sequence accompanying 
the channel state transition 1 — > is shown with a dotted box around the 
island. 

as a result of timing synchronization errors, wherein there is 
a frequency mismatch between the islands and the write head; 
or as a result of a group of islands being separated farther than 
usual or having larger switching fields. 

The channel in ([TJ and (|2) is completely defined by specify- 
ing the statistics of the Z process. We will consider the channel 
under two different statistical assumptions on the Z process 
in the following, and show how this difference in statistics 
changes the typical behaviour of the channel. 



1 ) Bernoulli state channel: When the channel state process 
Z is an i.i.d. Bernoulli(p) process, i.e., Z ~ £>(p), we will 
call the channel the Bernoulli state channel. In this case, 
the channel is completely specified by the parameter p. We 
shall henceforth denote this channel by W(p). Typically, for 
small values of the parameter p we can expect this channel to 
produce errors resembling substitution errors (See Fig. Ej). 

2) Binary Markov state channel: When the channel state 
process Z is a first-order binary Markov process^E ~ 
Mf ] ( Pi ,Pd), i.e., P^lZj- 1 } = ¥{Zi\Zi-x}, F{Z { = 
\\Zi-x = 0} = pi and F{Z t = 0|Z;_i = 1} = p d , we will 
refer to the channel as the binary Markov state channel and 
denote it by FK(pi , Pd)- As noted earlier, we can expect such 
a channel to typically produce paired insertion-deletion errors 
(See Fig. [3}. Hence, the parameters p; and p<j can be thought 
of as insertion and deletion probabilities, respectively, of the 
channel. 

We note here that this channel differs from the most general 
insertion-deletion channel in two ways. First, the inserted bit 
is always the same as the last written bit. Whereas this seems 
to be a serious limitation of the model in comparison with 
the general insertion channel model, it is to be noted that 
in most practical insertion-deletion channels — channels with 
synchronization errors, or sticky channels [8| — the inserted 
bit is usually the one last written (or transmitted), rather than 
being a random bit. Second, the insertions and deletions are 
paired so that one never sees two consecutive insertions or 
deletions. While this seems to limit the scope of the channel, 
a straightforward extension to channels with a finite maximum 
number of consecutive insertions or deletions is possible. 
However, the estimation of the channel characteristics becomes 
more complex as the maximum number of consecutive in- 



sertions increases (See Section VI I. Nevertheless, the binary 



Markov state channel gives us good insight into more general 
insertion-deletion channels. Moreover, unlike much of the 
previous work in the literature wherein channels cause only 
insertion 1 8 1 errors or only deletion ||9)-|fT4) errors, this model 
considers both insertions and deletions in the same setting. 
Among works that handle both insertions and deletions, the set 
up in this paper resembles the duplication-deletion channel of 
p5| more than it does the generic insertion-deletion channel 
of [ fl6| ]. However, the channel model considered here differs 
from those in the aforementioned papers in that the insertion- 
deletion process has memory, i.e. the insertions and deletions 
are not i.i.d. 

C. High-density recording with granular media 

The channel model in ([T} can also be used to describe high- 
density magnetic recording on conventional granular magnetic 
media flT) . Although media granularity is typically considered 
as a two-dimensional phenomenon, we consider granularity 
only in one-dimension - along the down-track direction, and 
assume adjacent tracks to be independent. This simplification 

'The superscript in the notation M? (PbPd) denotes the alphabet size 
over which the process is defined and the subscript denotes the memory in 
the process. The two arguments give the transition probabilities between the 
two states. 



allows us to establish lower bounds for performance over the 
two-dimensional channel as proposed in flT) . 

Media granularity in one-dimension results in written-in 
errors as follows. At storage densities of the order of 1 bit 
per grain, the variation in grain size plays an important role in 
deciding the reliability of data storage. In this case, bit-cells 
are at most as large as individual grains. We assume that the 
grains are all 1 or 2 bit-cells in size, and that the magnetic 
state of each grain is decided by the last bit written on them. 
This is depicted in Fig. |4] Grains that are 2 bit-cells large will 
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Fig. 4. Writing on granular media. Individual bit-cells are shown as x's 
in the first row, corresponding to the magnetic states read from these cells, 
which could be 0's or l's. Also shown in shades of gray are the magnetic 
grains: grains that comprise one bit-cell are shown in light gray and those 
that comprise two bit-cells are in a darker shade of gray. The two bit-cells 
comprising a grain of size 2 always have the same magnetic state. As in 
the case of BPMR, the write head spans multiple bit-cells, as shown by the 
dashed rectangles. The "current" bit-cell at each time instant is shown with a 
black square. 
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Fig. 5. Written-in errors due to media granularity. The Z process represents 
the grain pattern with Zi being 1 when the corresponding bit-cell is the first 
bit-cell in a grain that comprises two bit-cells. The Xi sequence shows the 
data to be written. The arrows between the Xi and Y; sequences show the 
transformation of information according to the grain pattern, resulting in the 
sequence Yi being stored. The written-in errors are shown with a box around 
the corresponding bit-cell. 



on obtaining bounds on the achievable information rates for 
the general channel in ([T| in the context of the two channel 



state processes defined in Section III-B 



IV. Bernoulli State Channel 

The channel space for the Bernoulli state channel defined 
in the previous section is parameterized by p e [0, 1]. Since 
the channel has memory, its capacity is given as 

C= Urn sup ~I(X?',Yfl. 

All the entropy and mutual information terms calculated are in 
bits, so that the capacity is always measured in bits per channel 
use. We will denote by C(p) the capacity of the Bernoulli state 
channel with parameter p. 

Proposition 1 (Channel symmetry): C(p) = C(p). 

Proof: Let us denote by l p (X";Y 1 n ) the mutual infor- 
mation between the vectors X™ and Y" when the channel 
parameter is p. Note that the channel output can be simulta- 
neously written as 

Yi = (Xi ® Zi) 8 (Xi-! ® Zi) 



result in written-in errors if the two bits written on them are 
different. Fig. [5] gives an illustration of the written-in errors 
in this scenario. Comparing Figures [3] and [5] it is easy to see 
that the channel model in this case can be written as 

Yi =Xi@(Xi®Xi +x )®Zi. 

Using a simple time-reversal argument, it can be seen that 
this is exactly the channel in ([TJ. However, in this case, the 
Zi sequence cannot have two consecutive ones, i.e., it satisfies 
the (l,oo) run-length constraint [18, Chap. 4]. We choose to 
model this as the W(p\, 1) channel since any realization of the 



./Vf i (pi, 1) process meets the run-length requirement. In 1 19 



the authors consider this channel and derive upper bounds for 
the achievable rates over the channel. Our focus here will be 



and 



Yi = (Si <g> Z^ (S q 



+i 



Zi) 



where Si — -Xi_j and Z,; = Zi ® 1. Thus 

Ip{Xi;Yi\X\ ) = I-p(S n -i+i;Yi\Sn_ i+2 ) 

for all but a vanishing fraction of indices i £ N, as n 
Therefore, 

C(p) = lim sup -I p (X?;Y?) 

rwoo p{x „ } n 
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= lim sup — 

™^°°p{x 1 "} n 



1 = 1 



i^-Y-ixr 1 ) 



i 

lim sup - >J/p(S' n _i + i;F 1 "|S', 



n-i+2J 



i=l 



1 



= lim sup -7p(S , 1 ™;Y 1 ") 

- lim sup i/pCxr" 1 ;^") 



'F{Jtf} 

lim sup i/p^y^) 



C(p). 



The channel space can therefore be reduced to the interval 
p € [0, |]. Further note that the same symmetry argument 
holds for not just the rate-maximizing input distribution, but 
for all input distributions. ■ 
The capacity of the Bernoulli state channel is upper bounded 
by the achievable rate for a genie-aided decoder, i.e., one 
with the Z process realization known. Given the realization 
of the Z process, the inserted bits and the positions of the 
deleted bits are known so that the Bernoulli state channel is 
equivalent to a correlated erasure channel with average erasure 
rate P{Zi_i — 1, Z t — 0} — pp. The resulting erasure channel 
is a correlated channel since, erasures being dependent on 
1 — >• transitions in the Z process, two consecutive bits cannot 
be erased. Therefore, 



C(p)<l-pp = C se (p) 



(3) 



since the capacity of a correlated erasure channel is the same 
as that of a memoryless erasure channel with the same erasure 
probability. We call this upper bound, C gE (p), the genie- 
erasure capacity of the channel VK(p). 
Consider 

-i(x\ 1 -y; 1 ) = -h(y; 1 ) - -h{y?\x\ 1 ) 

n n n 



(a) 1 
n 



n 

H{Y?)--Y,H{Yi\XU) 



i=l 



-h(y; 1 ) - ^Vp{i, + Xt-j}, (4) 

n n * — ' 

i=l 



where h 2 (-) is the binary entropy function |20|. The equality 
labelled (a) follows from the definition of Yi in ([TJ and the fact 
that Z ~ B(p). Since the information-rate-maximizing input 
distribution is unknown, we will now derive lower bounds to 
the capacity by making certain assumptions about the statistics 
of the input process X. 

A. I.i.d. input process 

We will first assume that the input process X is an i.i.d. 
B(a) process. With this assumption, the maximum achievable 
information rate, called the i.i.d. capacity, denoted Cud{ct, p), 
gives a lower bound to the capacity. 

Proposition 2 (Input symmetry): Cud((x, p) = Cad(a, p). 
Proof: Let X™ and Y™ be the input and output respec- 
tively of the channel W(p). Define X™ = (X 1 ®1,X 2 ® 
,X n ®l) andYf = (Yi®l,y 2 ©l,--- ,Y n ®l). When 



X? ~ B(a), XI ~ B(a). Further since X? O X? and 
Y" f-> Y™ are bijections, we have 

/p(xp ; y 1 ") = 7(x«;y 1 "). 

Clearly, X™ and Y" also satisfy the relation in ([TJ and 
consequently 

I{X?-,Y?) = I p {X?-Y?)- 
From this and the fact that 



C iid (a,p) = lim -/ p (X™;Y") 

n— >oo n 



we have the desired result. ■ 
As a consequence of Propositions [JJ and [2] we have 

Cnd(a,p) = C iid (a,p) = C U d(a,p) = C iid (a, p). 

From Proposition [2] and the fact that Ca d (a, p) is concave 
in a, we immediately have the following. 

Corollary 3 (Rate-maximizing i.i.d. distribution): 

max C ild (a, p) = C lid {\, p) V p € [0, 1]. 
ae [o,i] 2 

When the binary input is i.i.d. with a uniform distribution 
(sometimes abbreviated as i.u.d.), the corresponding rate is 
called the symmetric information rate (SIR) and is denoted 
Ciud{p)- The SIR is of interest because it can be achieved by 
a random linear coset code pT| . 
We have from Q 

Hp) 



c(p) > c lud {p) = n(y) 



(5) 



We can lower bound the SIR by disregarding the data- 
dependence of the noise in the channel. This gives a channel 
equivalent to a BSC with crossover probability p/2 so that 



C iud (p)> l-h 2 (^)=L™ d (p). 



(6) 



Further lower bounds can be obtained by conditioning the 
entropy of the output as follows: 



n 

c iud {p) = lim -V/ftyiy;- 1 ) 

n, — vcyc> n ' * 



Hp) 



> lim ^EWYt 1 ,^)-^ 



i=l 

1-P 



Hp) A Tlud 



L i (p), 



(7) 



where we have used the fact that Yi depends only on and 



Xi, and given Xi-\, Yi is independent of Y[ . Continuing 
as above, we can obtain a tighter lower bound 

c iud ( P ) > lim - Vi^yiy/- 1 ,*^) - ^ 



1 



2 

1-P 



h 2 
h 2 



1 



2(1 + P) 
1-P 



^ ± L*«<(p). (8) 
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A straightforward upper bound for the SIR, implied by |5]) is 

(9) 



n ( n \ <r- 1 ^(Pj A TT iud(\ 

^iudiP) < 1 — = U Q (p), 



which follows because the entropy rate for a binary process 
H{y) < 1. Note that this bound is achieved when y is the 
i.i.d. $(1/2) process. Again, starting from (|5j, we can obtain 
upper bounds for the SIR by removing conditioning from the 
entropy of the output, as shown below: 



1 " 

C iud (p) < lim - Y,H(Yl\Yi-i) 



^(l^PEV^^^P). do) 



As with the lower bounds, we can find a tighter upper bound 
for the entropy rate H(y) as follows 



n 

c iud ( P )< lim -V^y.iF/- 1 )- 



Mp) 



i - pp. 



2(1 - pp) 



1 + PPi 



1 



2(1 + pp) 



MP) A ^ 
(11) 

Fig. [6] plots the lower and the upper bounds for SIR 
discussed above. Note that the tighter lower and upper bounds 




Fig. 6. Lower and upper bounds for SIR given in j6j, j7}, j8j, JTo} and 
\i I) , and the upper bound for capacity in J5J. The SIR from simulations is 
also shown. 

— L 2 ud {p) i R © an( l U2 Ud (p) in ( fTTj ) — almost coincide for 
p < 0.3 (and from symmetry, for p > 0.7). In this range, 
therefore, where the bounds themselves are greater than 0.5, 
they approximate the SIR fairly accurately. The process y is 
ergodic, as all channel states can be reached within a finite 



number of steps at any time with strictly positive probability 
J22} §5.3], and it converges to a stationary process. As a result, 
from the Asymptotic Equipartition Property [20 §3.1], we 
have the entropy rate 

H(y)= lim -H(Y 1 n ) = - lim -log 2 F{Y?}, 

n— foo Tl n— >oo Tl 

which can be numerically evaluated through a forward pass 



of the BCJR algorithm |23) , |22 §5.3]. By using long enough 
sequences X[ l and Y™ in the computation, the SIR can be 
obtained with an accuracy of 0(-5=). This is shown as the 
"SIR" curve in Fig. |6| from which we conclude that the upper 
bound [7™ d (p) in ( fllj ) is a good approximation for the SIR. 



B. First-order Markov Input process 

To explore the loss in the achievable rate due to independent 
uniformly distributed input, we let the source have memory. 
We consider a first-order binary Markov input process X. 
Taking a cue from the input symmetry of the Bernoulli state 
channel, we consider a symmetric binary Markov process X 
with P{X, = 1IJ5Q.! = 0} = P{X, = 0|Xi_i = 1} = 0. 

(2.) 

We denote this by X ~ M\ '(P), meaning that X is a 
binary (alphabet of size 2) Markov source with memory 1 
and transition parameter (3. 

Starting from Q, we can arrive at lower and upper bounds 
for the Symmetric Markov-1 Rate (MIR), Ca/i(/3, p), which 
we define as the maximum rate of information transfer when 

(2) 

X ~ M\ (/3). The lower bounds analogous to those for the 
SIR are 

C M i(Ap)>l-/i2(|)=ir(p)- 
which is the same as L™ d (p) in |6|, 

c M1 (M=ny)\ x ^ M?)w -^ 2 (p) 
i ™ 

> lim - y / H(Y l \Y{- 1 ,X^ 1 )~f3h 2 (p) 



h 2 (pp)-ph 2 (p)=Lr i W,p), 



and 



1 " 

C M iW,p)> lim - y2H(Yi\Y^\X^ 2 )~ (3h 2 (p) 

>(1-/3(1- P 2 ))\ 




1-/3P J 

+ f3ph 2 (pp) - ph 2 (p) = i 2 M1 (/?, P ). (12) 

The trivial upper bound analogous to Ul ud (p) in ([9| is 

Cmi(/3, p) < 1 — (3h 2 (p) = (7o' fl (/3, p). 
The upper bounds corresponding to JJ[ ud and L^ 1 " 4 are 
CmiW, p)<h 2 (l-(3 + 2/3 2 pp) - ^ft 2 (p) £ C^ 1 ^, p) 
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and 



C 



Mi (A P) < {/3 2 ^(1 - PP) + /3 2 (3 - /3)pp + (1 + /3)/3 2 } 



/3 2 /3(l-pp)+/3 3 pp + ^ \ 
V«l - pp) + £ 2 (3 - /3)pp + (1 + f3) f J 
+ {2/^(1 - pp) + /3 3 (1 - 2pp) + pf) 

' /3 2 fl(l-pp)+/? 3 pp + fig 2 \ 

-ph 2 {p) = U^{p,p). (13) 

Fig. [7] shows the contours of the bounds for Cm\ in ([12) and 
([13). (Only the tighter bounds, Lf^OS, p) and U$ n {[i, p) are 
shown.) As was the case for i.u.d. input, the tighter bounds 



x hi 



x h 




Fig. 7. Contours of lower (thin curves) and upper (thick curves) bounds for 
MIR given in {12} and fT3> . 

( fT2) and ( fT3) have almost coinciding contours for a wide 
range of parameters (/?, p). Unlike the case of i.i.d. inputs, the 
rate-maximizing input parameter /3*(p) is not easily obtained. 
A close estimate can be obtained by maximizing the bounds 
obtained above. These are shown (dotted lines) in Fig. [7] Since 
the optimal j3 values for the tighter lower and upper bounds 
I j 2 I1 (/3,p) and C/^ l/1 (/3, p) almost coincide, we can say that 
/3*(p) is monotonically decreasing in p in the interval [0, h] 
with /3*[Q) = \ and /3* (1/2) « 0.31. 

Fig. ^compares the SIR (solid line representing L 2 ud (p) 
in (pj), and dashed line representing C/| (p) in ( [TT| ) and the 
MIR (sohd Une for L%* l (Pi a (p),p) in dashed line for 
U2 I1 (f3lf 2 (p), p) in ( fTJ) ) over the range of p values. It is clear 
that considerable gains in reliable information transfer rate are 
possible by using an input with memory. In particular, note 
that whereas there is a range of p values for which the SIR 
is smaller than 0.5, the MIR is strictly larger than 0.5 V p. 
It is clear that for a sequence of input Markov processes of 
increasing orders, the achievable rates are non-decreasing. The 




Fig. 8. Comparison between the tighter lower and upper bounds for SIR and 
MIR. 



algorithm suggested in [24|, |25| can be employed to optimize 
the input Markov process to maximize the rate. 



V. Binary Markov State Channel 



defined in Section 



III-B2 



The channel sp ace for the binary Markov state channel 
_ is (pi,p d ) € [0,1] 2 . Note that the 
i.e., the first parameter is the — > 1 





channel space is ordered 

transition (insertion) probability and the second the 1 
transition (deletion) probability. As in the case of the Bernoulli 
state channel, the capacity of the binary Markov state channel, 
denoted C(p,, pd), is given as 



C(Pi,Pd) 



■II 



lim sup -/ (pi . Pd) (AT;r»). 

°°p{x 1 "} n 



We shall assume that the channel has converged to the station- 
ary distribution. This means that when pj (resp., p^) is zero, 
the channel is the noise-free channel (resp., noise-free channel 
with a delay) and hence C(0, pd) = 1 (resp., C(p,, 0) = 1). We 
first establish the following symmetry property of the binary 
Markov state channel. 

Proposition 4 (Channel symmetry): C(p\, Pd) 
Proof: We know that if Z r { = (Z 1 ,Z 2 ,- 
Markov process, then so is (Z n , Z n _i, ■ ■ ■ ,Z\) 
Furthermore, since the channel is assumed to have converged 
to the stationary distribution, the conditional distributions 
F{Zi\Zi-i] and ¥{Zi\Z l+ i} are identical. However, note that 
whereas a transition Zi-\ = — > Zi = 1 is an insertion, the 
transition Z i+1 = — >• Zi = 1 is a deletion. This implies that 
I^iXr^lXt 1 ) = I (p6tPi) (S n - i+1 ;Yr\S^_ i+2 ), where 
S{ = Xi-i, for all but a vanishing fraction of indices i, as 



C(p d , Pi). 
, Z n ) is a 
16-4]. 



|26 
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i — y oo. Therefore 
C(pi, Pd 



1 



- lim sup -I^XfrYT) 

„->oo p{x „ } n 

1 " 

= lim sup T/wftlTI^ 1 ) 



lim sup 



1 



pi Tl — 

1 



= lim sup -I (pilPI )(S?;l?) 
= lim sup -/(p^jfir 1 ;^) 

n^oo p{x „ } 71 

= lim sup -7 (pd!Pi) (X 1 ™;F 1 ")=C(p d)Pi ). 

„->oo p{x „ } n 

The channel space can therefore be reduced to the region pj e 
[0, 1], Pd € [0, pi]. As was the case for the W(p) channel, we 
have the same symmetry for any fixed input distribution. ■ 
As a consequence of Proposition |4j we can assume an un- 
ordered pair {pi, Pd} parameterizing the channel space. 

As for the Bernoulli state channel, we can define the 
genie-erasure capacity C 9£ (pi, pd) of the binary Markov state 
channel VK(pi, Pd)- In this case, the resulting channel is a 
correlated erasure channel with an average erasure probability 
P{^_! = 1, Zi = 0} = JMtt_, so that 

PiPd A 



C(Pi,Pd) < 1 



Pd 



C g£ (pi, Pd) 



(14) 



0.5 



0.6 



Ciud(P\, Pd) 

0.7 0.8 



0.9 




Fig. 9. The SIR Ci u a(p\, Pd) for the Vy(pi, pd) channel. 

As a result of the memory in the Z process, it is con- 
siderably harder than it was for the W(p) channel to obtain 



closed-form expressions for lower bounds on the capacity of 
W(pi, Pd) by computing information rates for known input dis- 
tributions. However, the W(pj,Pd) channel is still an ergodic 
channel, and the SIR Ci U d(p;d, Pid) can be obtained numeri- 
cally. Fig. [9] shows the contours of the SIR for the W( P i, Pd) 
channel. Note that the symmetry proved in Proposition [4] is 
evident. 



0.8 



0.0 



0.4 - 



0.2 - 




Cge(Pid, Pid) 











, Pid) 





















0.2 



0.4 



0.0 



0.8 



Pid 



Fig. 10. The SIR Ci u d{p\d, Pid) and the genie-erasure capacity C SE (Pid > Pid) 
for the symmetric binary Markov state channel W^py, Pid)- 



In Fig. 10 we show the SIR for the case pj = p<j = p^, 
which we call the symmetric binary Markov state channel 
W^Pid, Pid)- The values of the SIR when p;d = and p;d = 1 
are easily explained. When pid = 0, the channel is noiseless. 
When pid = 1, the channel deterministically flips between 
the identity and the delayed channel so that every odd bit is 
repeated twice, and every even bit is lost, and the maximum 
achievable information transfer rate is h bit per channel use. 
Also shown is the genie-erasure capacity in (14) , which in this 
case becomes, 



C(pid, Pid) < Cge(Pid, Pid) 



1 



1, 



Ci u d(l, 1 

in Fig. [Tl 



_ Pid 
2 ' 

the SIR satisfies 
r. We also include 



Interestingly enough, when pid = 
C fle (l,l), so that C(l,l 
the SIR obtained for the channel W( P i, 1), as well as 
the corresponding genie-erasure capacity, C 9£ (pi,l) = y^Pp"- 



VI. if-ARY Markov State Channel 

We now consider a generalization of the binary Markov 
state channel as described in Q. Here, we allow Zi € TL = 
{0, 1, • • ■ , K — 1} and let Z be a first-order Markov process 
whose transition probabilities satisfy 

F{Z i = z\Z i _ 1 =z-l} = p i , 
P{Z J = z|Z 4 _ 1 =z + l} = p d , 

F{Zi = z\Zi_t = z) = 1 - pi - p d 
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Fig. 11. The SIR Cj u( j(pi, 1) and the genie-erasure capacity C 9e (pi, 1) for 
the channel W(pi, 1) with the Z process satisfying the (1, oo) constraint. 



for z £ {1, 2, • • • . K — 2}. Further, we have 

¥{Z t = l\Zi-x = 0} = Pi = 1 - F{Zi = 0\Zi-! = 0} 

and 

¥{Z l = K-2\Z l ^ 1 =K-l} = p d 

= 1 - P{Z l =K- = K - 1}. 

We denote this by Z ~ M.\ ' (pi, Pd). Note that when K = 2, 
this model gives the binary Markov state channel considered 
earlier. We will hence be interested in the K-aiy Markov state 
channel where K > 2, which we denote by W(p;,p d , K). 
Note that this channel now generalizes the binary Markov state 
channel in the sense that it allows up to [K — 1) consecutive 
insertions or deletions. 

We further assume that the parameters p; , pd are chosen 
such that the process Z is aperiodic and irreducible so that 
y is ergodic. The channel space is given by pj £ [0, 1], p<j € 
[0, 1 — pi]. The channel symmetry argument of Proposition [4] 
holds in this case also, so that 

C(pi,p d ,A") = C(p d ,Pi,.*0 

where C(p\, Pd, K) is the capacity of K-ary Markov state 
channel W(pi, Pd, K). Hence, the channel space can be fur- 
ther reduced to p; £ [0,1], Pd £ [0,min{pj,l — pj}]. As 
was the case for the binary Markov state channel, we have 
C(0, p d , K) = C( Pi , 0, K) = 1 V Pi , p d € [0, 1]. 

The genie-erasure capacity of the W(pi, Pd,^0 channel is 
given by the capacity of a correlated erasure channel with an 



K-i 



[Zi = Z^ - 1} = p {^-i =z,Zi = z-l} 

z=l 

K-l K 

= p {Z t -i = z}¥{Z. t = z - l\Zi- X = z}=J2 ^P d 



K-l 



z=l 



= Pd 1 - 



K-l 



K-l-z 




is the steady state 



where ir z — 

probability of Z% = z, so that 

C{p h pi,K) < 1- p e = C ge {p h pi,K). 
For the symmetric K-ary Markov state channel, we have 

K-l 

C(p id ,Pid,^) < 1- Pid K = C S e(Pid, Pid,-^) 

for pid £ [0, g] because in this case, -k z — -h V z £ 
{0, 1, • • • , K — 1}. Note that C ge (pid, P\i,K) reduces to the 
capacity of a BEC with erasure probability p;d, Cbec (Pid) = 
1 — pid, when K — > oo. 

The SIR of the W(p\,pd, K) channel can be obtained 
numerically as in the case of the W(pi, Pd) channel. However, 
the computational complexity of the BCJR algorithm |27] used 
to estimate the SIR increases roughly exponentially in the size 
K of the alphabet of the process Z, and hence the evaluation 
of the SIR for K > 2 may require considerable computing 
resources. We carried out the calculation for the case when 



K = 3, and the estimated SIR is shown in Fig. 12 



0.8 



0.6 





1 




i 1 

, Pid , 3) 


c 


c iud 


>id, Pid, 3) 
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i i i 



0.1 



0.2 



0.3 



0.4 



0.5 



Pid 



Fig. 12. The SIR Ci u d(p\d , Pid , 3) for the symmetric 3-ary Markov state 
channel, along with the genie-erasure capacity C g£ (p\d, pid,3). Also shown 
is the lower bound on the zero-error capacity Cq(Ws) from Corollary |6] 
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VII. Zero-error capacity 

The zero-error capacity Co of a discrete-memoryless chan- 
nel was introduced by Shannon J28J, from which it is readily 
seen that a noisy discrete-output binary-input memoryless 
channel has a zero-error capacity of 0, i.e., no information can 
be transmitted over such a channel with zero error. However, 
we will now show that the zero-error capacity of the noisy 
discrete-output binary-input channel in (JTJ is strictly positive. 
We will denote the generic channel in ([TJi for all binary Z 
processes by W. 

Proposition 5: Co(W) = \. 

Proof: Consider the 2-repetition code. The channel input 
process X and the message process satisfy the relationship 
X 2 i-i = M t ,X 2l = Mi for i = 1,2, Since X 2i -i = 
X 2 i, Y 2 i = X 2 i = Mi so that discarding the Y2i-is gives 
us *M exactly, thereby achieving zero error at a rate | . Thus, 

c (W)>h 

It is obvious that the capacity of a channel is an upper 
bound on the zero-error capacity. From (fHJ, we can see that 
(7(1,1) < |, and the coding scheme above achieves a rate 
0.5 for any realization of Z. Thus, (7(1,1) = \. Therefore, 
as long as the channel admits the infinitely long alternating 
sequence Z = 010101 • ■ ■ , we have C (W) = \. 

Note that both the Bernoulli state channel W(p) and the bi- 
nary Markov state channel W(pi, Pd) can generate the infinite 
sequence Z = 010101 • • • , albeit with vanishing probability. 
It is also worth noting here that this infinite sequence satisfies 
the (l,oo) constraint (See Section [Ill-C I. ■ 
Thus, unlike binary-input discrete memoryless channels, the 
binary-input discrete channel in ([TJ allows a non-zero infor- 
mation rate even under the severe requirement of zero-error. 
From this result and Fig. [8] it is clear that even under the 
milder condition of asymptotically vanishing error probability, 
random linear coset coding achieves a lower rate than the 2- 
repetition code, which guarantees zero errors, for the Bernoulli 
state channel over a range of p values. Referring to Fig. [9] 
the same can be said for the binary Markov state channels. 
However, by using a first-order Markov input (cf. Fig. [8}, 
higher rates than that of the zero-error achieving scheme are 
achievable for all Bernoulli state channels, although only with 
asymptotically vanishing error probability. 

The zero-error capacity of the W(p,, pd, K) channel, de- 
noted by Cq(Wk), satisfies the following bounds. 

Corollary 6: ± < C Q (W K ) < ^±1. 

Proof: Repeating every bit K times achieves zero error, 
since every K lh bit is always correct for any realization of the 
Z process. Hence, C (Wk) > ^ — C (Wk)- 

The smallest upper bound for the capacity is also an upper 
bound for the zero-error capacity. Thus, 

C (W K ) <minC(p hPd ,K) 
pi,Pd 

< min(7 !?£ (pi, Pd,K) 
Pi>Pd 

Observe that when K = 2, the minimum genie-erasure 
capacity occurs at p; = pd = 1 and at this point the upper 



bound C ge (l, 1, 2) is same as the lower bound C_ (W 2 ) which 
was used in the proof of Proposition [5] ■ 
Note that the bounds proposed above are loose asymptotically, 
i.e. 



i _> and f±i 



I as K 



VIII. Conclusions 

We proposed a new write channel model for bit-patterned 
media recording that reflects the data dependence of write 
synchronization errors. This model generates both substitution- 
like errors and insertion-deletion errors whose statistics are de- 
termined by an underlying channel state process. For Bernoulli 
and Markov channel state models, we studied information- 
theoretic limits imposed by the channel, computing bounds and 
numerical estimates for the maximum achievable information 
rate under different assumptions on the channel input statistics. 
We then generalized the Markov channel state model to allow 
a channel state space of size K > 2 and computed the SIR 
numerically for the case K = 3. Finally, we showed that 
the rate-i 2-repetition code achieves the zero-error capacity 
over the new write channel when the channel state space 
is binary. Bounds on the zero-error capacity of the general 
K-axy Markov channel state model were also presented. In 
future work, we plan to combine the new write channel model 
in cascade with the partial-response readback channel model 
and investigate the achievable rates, as was done for the i.i.d. 
insertion-deletion channel in fl6| . Furthermore, the channel 
model considered here can be readily represented using a 
factor graph and hence the construction of sparse graph-based 
codes with iterative decoding techniques seems possible. 
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